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ABSTRACT 


The Global Precipitation Climatology Project (GPCP) has released the GPCP Version 1 Combined Precipitation 
Data Set, a global, monthly precipitation dataset covering the period July 1987 through December 1995. The primary 
product in the dataset is a merged analysis incorporating precipitation estimates from low-orbit-satellite microwave 
data, geosynchronous-orbit-satellite infrared data, and rain gauge observations. The dataset also contains the individual 
input fields, a combination of the microwave and infrared satellite estimates, and error estimates for each field. The 
data are provided on 2.5° x 2.5° latitude-longitude global grids. Preliminary analyses show general agreement with 
prior studies of global precipitation and extends prior studies of El Nino-Southern Oscillation precipitation patterns. At 
the regional scale there are systematic differences with standard climatologies. 


1. Introduction 

The latent heating associated with precipitation is 
a primary atmospheric energy source; the spatial and 
temporal distribution of precipitation around the globe 
is crucial information for advancing our ability to 
model and predict global long-range weather and cli- 
mate changes. Furthermore, as reviewed by Arkin and 
Xie (1994), the distribution of precipitation is needed 
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for a variety of other scientific uses, such as climate 
diagnostic studies, and societal applications such as 
water management for agriculture and power, drought 
relief, flood control, and flood forecasting. The task 
of quantifying the distribution is complicated by the 
fact that no single currently available estimate of pre- 
cipitation has the necessary coverage and accuracy 
over the whole globe. 

The Global Precipitation Climatology Project 
(GPCP) was established by the World Climate Re- 
search Program (WCRP) in 1986 to address this prob- 
lem [WCRP (1986); see Arkin and Xie (1994) for a 
summary of its initial objectives and organization]. 
The GPCP’s present goal is to provide monthly mean 
precipitation data on a global 2.5° x 2.5° latitude- 
longitude (lat-long) grid for the period 1986-2000. 
The general approach is to combine the precipitation 
information available from each source into a final 
merged product, taking advantage of the strengths of 
each data type. The microwave estimates are based on 
Special Sensor Microwave/Imager (SSM/I) data from 
the Defense Meteorological Satellite Program (DMSP, 
United States) satellites that fly in sun-synchronous 
low-earth orbits. The infrared (IR) precipitation esti- 
mates are obtained primarily from geostationary sat- 
ellites operated by the United States, Europe, and 
Japan, and secondarily from polar-orbiting satellites. 
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The gauge data are assembled and analyzed by the 
Global Precipitation Climatology Centre (GPCC) of 
the Deutscher Wetterdienst. 

The individual single-source estimates (microwave, 
IR, gauge data) are valuable in their own right and 
have been used in a variety of scientific studies (e.g., 
Janowiak and Arkin 1991; Chang et al. 1995; Weng 
etal. 1994; Rudolf etal. 1996). Each input dataset has 
strengths and weaknesses in estimating global precipi- 
tation. The precipitation estimates made from geosta- 
tionary IR data are based on frequent (3 h) images, 
thereby providing good temporal resolution of precipi- 
tation systems and coverage of the diurnal cycle. 
However, the relation between IR radiance and instan- 
taneous surface precipitation is relatively weak and 
useful primarily for deep convective systems in the 
zone 40°N-S. The SSM/I radiances have a strong 
connection with surface rainfall, especially over the 
ocean, and are useful to much higher latitudes. 
However, the SSM/I observations have poor temporal 
sampling, averaging only 1.2 images day 1 in the 
Tropics and subtropics with one satellite. Gauge re- 
ports are considered very accurate (at least after cor- 
rection for systematic measurement errors), but gauge 
sites are mostly limited to land areas and are concen- 
trated in developed countries. 

Recognizing such shortcomings, the GPCP has 
promoted the development of an analysis procedure 
for blending the various estimates together to produce 
the necessary global gridded precipitation field. The 
current procedure is based on Huffman et al. (1995) 
and has been used to produce the GPCP Version 1 
Combined Precipitation Data Set, covering the period 
July 1987 through 1995. The primary product in the 
Version 1 dataset is a combined observation-only 
dataset, that is, a gridded analysis based on gauge 
measurements and satellite estimates of rainfall. 
It will be useful for climate model validation, hydro- 
logical and climate monitoring, and diagnostic stud- 
ies. Gaps in coverage at high latitudes will be 
addressed in a future release that may include numeri- 
cal weather prediction or additional satellite estimates 
of precipitation. 

The following sections describe the techniques 
used in the GPCP Version 1 dataset and provide 
examples. Section 2 summarizes the input datasets and 
the approach used to arrive at the error estimates for 
those fields. Section 3 describes the technique used 
to combine the individual estimates into the blended 
products. Examples of the products, some preliminary 
research results, and initial validation studies are pre- 


sented in sections 4 and 5. A summary and directions 
on how to obtain the dataset are given in section 6. 

2. Individual dataset summaries 

a. Rain gauge analysis 

Precipitation measurements from rain gauges gen- 
erally have high accuracy. However, they represent 
point values of a highly variable parameter observed 
on a sparse irregular grid and must be converted to 
area means before use in most applications. The 
GPCC uses a variant of the SPHEREMAP interpo- 
lation routine (Willmott et al. 1985) to interpolate the 
station data to regular grid points (0.5° lat-long mesh). 
These regular points are then averaged to provide 
area-mean monthly total precipitation on 2.5° grid 
cells. A detailed description of the GPCC data pro- 
cessing and analysis system is given by Rudolf ( 1 993). 

Quality control is crucial before carrying out the 
analysis because some records are contaminated by 
coding, typing, or transmission errors. The GPCC 
manually reviews station location and has developed 
a two-step quality control for precipitation records. 
The first automatic check on precipitation compares 
individual data values to area means and climatologi- 
cal normals. Questionable data are reviewed in the 
second manual step, which considers orographic con- 
ditions and catalogs of catastrophic events. The sec- 
ond step is key to capturing the extreme events that 
characterize the time— space distribution of precipita- 
tion (Rudolf 1993; Schneider 1993). 

Precipitation measurements from rain gauges are 
also affected by systematic errors, that is, losses due 
to wetting, evaporation, or aerodynamic effects. The 
last is more important for snow than for rain. The 
GPCC uses the climatological estimates of systematic 
error from Legates (1987), which are based on 
Sevruk’s (1989) approach. 

The Version 1 rain gauge product is based on about 
6700 rain gauge stations worldwide, mostly synoptic 
and monthly climate reports collected from the Glo- 
bal Telecommunications Network in real time, supple- 
mented by other worldwide or continental data 
collections. Currently the GPCC is working to as- 
semble and quality control a much larger set of sta- 
tions (about 40 000). 

b. Microwave estimates 

The microwave brightness temperatures (T b ) 
observed from a spaceborne sensor are dependent 
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upon the emission from the earth’s surface and modi- 
fied by the intervening atmosphere, principally 
due to hydrometeors. Since mid- 1987 the SSM/I has 
been the main source of microwave data, providing 
four frequencies (19.35, 22.235, 37, and 85.5 GHz) 
with dual polarization on each frequency (except ver- 
tical alone at 22 GHz). The GPCP has chosen to em- 
ploy two SSM/I algorithms, one for ocean regions 
based on emission and the other for land areas based 
on scattering. 

1 ) Emission estimate 

Water surfaces have low emissivity, so the emis- 
sion signal in the lowest-frequency SSM/I channels 
due to hydrometeors is clearly detectable over oceans. 
In particular, the T b for the vertically polarized 
19.35-GHz channel [T b ( 19v)] can be related to the rain 
intensity over the dynamic range of 2-25 mm h 1 from 
first principles with only a few approximations. 
Following the Wilheit et al. (1991) histogram ap- 
proach to retrieving rainfall, the rain rate is modeled 
as a mixed distribution made up of a discrete prob- 
ability of no rain and a lognormal distribution for rain 
events. Hence, the parameters of the rain-rate prob- 
ability distribution function can be related to the 
lower-frequency SSM/I T b histogram for the month 
and solved iteratively. The SSM/I channel combina- 
tion [2T b (19v)-T b (22v)J is used to minimize the ef- 
fect of varying water vapor in the atmospheric path. 
The height of the freezing level, which is needed by 
the radiative transfer calculation, is determined from 
scattergrams of T b (19v) and T b (22v). The resulting 
SSM/I rain-rate estimates are multiplied by 1 .5 to 
account for the beam-filling bias (Wilheit et al. 1991; 
Chiu et al. 1993). Whenever the method fails to con- 
verge on the nominal 2.5° latx 2.5° long grid, values 
computed on a 5° lat x 5° long grid are substituted. 
The emission estimates are computed by the Labora- 
tory for Hydrospheric Processes at the NASA/Goddard 
Space Flight Center. 

2) Scattering estimate 

Ice hydrometeors scatter upwelling radiant energy 
at the higher SSM/I frequencies, an effect that pro- 
vides strong correlation with surface rainfall and is 
detectable over both land and water. The primary 
GPCP scattering algorithm is an 85-GHz technique 
(Ferraro et al. 1994) based on the Grody (1991 ) scat- 
tering index (SI). It has separate components for land 
and ocean, as well as screening tests for the removal 
of artifacts caused by various surface types. The 


coastline is thickened by 50 km, and the land 
component is applied over both land and “coast.” The 
algorithm has been calibrated to instantaneous rain 
rates from ground-based radar measurements (Ferraro 
and Marks 1995). 

A fail-back scattering technique was also required 
because the 85-GHz sensors in the first operational 
SSM/I (aboard the DMSP F-8 spacecraft) failed by 
June 1990 due to improper shielding from solar ra- 
diation. This failure prevented further use of the 
85-GHz SI until the advent of new SSM/I units aboard 
the DMSP F-10 and F-l 1 spacecraft in late 1991. In 
the interim, an alternate SI was employed that uses 
the scattering signal in T b (37v). The 37-GHz SI was 
developed in parallel with the Grody (1991) study and 
includes a linear calibration with radar data. While the 
85-GHz SI technique can detect rain rates as low as 
1 mm h _l , the 37-GHz SI is only sensitive to rain rates 
of 5 mm h _l or greater. 

The monthly scattering-based rainfall field is com- 
puted by the Office of Research and Application of 
the NOAA National Environmental Satellite Data and 
Information Service as follows. SSM/I pixels are ac- 
cumulated on separate daily 0.333° lat-long grids for 
ascending and descending orbit segments. Then all 
these grids are averaged to a 2.5° lat-long grid for the 
month. 

For completeness, we note that the scattering esti- 
mates over ocean depend on additional processing 
(Weng and Grody 1994). As such, this additional pro- 
cessing solely affects the global scattering estimates 
that appear as the “SSM/I scattering” product in the 
Version 1 dataset, but not any of the other input or 
combination fields. 

3) Microwave merger 

For the most part, the merger of the two microwave 
estimates is simply a case of taking histogram-based 
emission estimates over water and Si-based scattering 
estimates over land. However, the emission technique 
eliminates land-contaminated pixels individually, so 
near-coast emission estimates for the month can be the 
result of relatively few pixels. Some preliminary study 
led us to take the emission estimate alone when it has 
at least 75% of the sampling of the scattering tech- 
nique, and at lower sampling values do a weighted in- 
terpolation between the two as 


D _ ^emiss^einiss (^scat ^emiss)^cal 

1 combined » T , (1) 


N, 


scat 
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where R is the precipitation rate and N is the number 
of samples, and “combined,” “emiss,” and “scat” denote 
combined, emission, and scattering estimates, respec- 
tively. The somewhat low threshold of 75% was cho- 
sen because the two techniques count samples 
differently. 

c. IR estimate 

Each cooperating geostationary satellite operator 
(the Geostationary Operational Environmental Satel- 
lites, or GOES, United States; the Geostationary Me- 
teorological Satellite, or GMS, Japan; and the 
Meteorological Satellite, or Meteosat, European Com- 
munity) accumulates 3-h geo-IR imagery into 16-class 
histograms of IR T b on a 2.5° lat-long grid in the zone 
40°N-S for each pentad of days (1-5 January, 6- 
10 January, . . 27-31 December). Separate histo- 
grams are accumulated for each 3-h period of the day 
(0000, 0300, . . ., 2100 UTC), which preserves the 
mean diurnal cycle for each pentad. The global IR 
rainfall estimates are then generated from a merger 
of these data at the Climate Prediction Center using 
the GOES precipitation index (GPI; Arkin and 
Meisner 1987) technique, which relates cold cloud- 
top area to rain rate. 

The original geo-IR data contain viewing-geometry 
effects, which impact the GPI estimates. The geo- 
metrical effects result in increased apparent cloud 
cover, correspondingly lower brightness temperatures, 
and erroneously high rainfall rates in regions that are 
far from the satellite nadir point. Empirical corrections 
have been developed for this effect (R. Joyce and 
P. Arkin 1997, submitted to J. Atmos. Oceanic Technol.) 
and applied to the GPI estimates used here. The GPI 
was not corrected in this study for the less-severe ef- 
fects of intersatellite calibration, although R. Joyce 
and P. Arkin (1997, submitted to J. Atmos. Oceanic 
Technol.) have recently developed such corrections. 

In cases where geostationary data are unavailable, 
the National Oceanic and Atmospheric Administra- 
tion (NOAA) polar-orbiting satellites are used to pro- 
vide three-class histograms of brightness temperature 
(Janowiak and Arkin 1991) or, when histograms are 
not available, just the integrated value of outgoing 
longwave radiation. GPI is derived from the histo- 
grams, and an estimate calibrated by GPI is gener- 
ated by linear regression from the flux values 
(Janowiak and Arkin 1991). Depending on the satel- 
lites contributing, the NOAA data provide 0-4 im- 
ages per day, at most half the samples available from 
the geostationary data. 


d. Error estimates 

It is critical to our understanding of the data, as well 
as to the combination approach described below, to 
have estimates of errors that vary with space and time 
for each precipitation field. Unfortunately, a general 
statement of the problem (see North et al. 1991) makes 
it clear that we will have to accept some strong ap- 
proximations to achieve useful results with the tech- 
niques and data available. 

The first step is to neglect bias error and only com- 
pute random errors. This approximation is justified be- 
cause experience shows that random errors usually 
dominate bias for monthly 2.5° grids. In addition, the 
combination technique adjusts biases in individual 
fields before combination, where possible. The sec- 
ond step is to develop functional representations 
of random error. This task is difficult because the 
various input data sources have such disparate behav- 
ior in any statistical properties beyond precipitation 
total. The final form for this study, described by 
G. Huffman (1997, submitted to J. Appl. Meteor.), is 

g , s H( F + S )[24 + 49 ^] (2) 

where cr 2 is the error variance of an average over a 
finite set of observations, H is taken as constant (ac- 
tually slightly dependent on the shape of the precipi- 
tation rate histogram), r is the average precipitation 
rate (in mm day 1 ), S is taken as constant (actually a 
for r = 0), N. is the number of “independent” samples 
in the set of observations, and the expression in square 
brackets is a parameterization of the conditional pre- 
cipitation rate based on work with the Goddard Scat- 
tering Algorithm, Version 2 (Adler et al. 1994) and 
fitting of (2) to the Surface Reference Data Center 
(SRDC) analyses described below. The third step is 
to set the “constants” H and S from data for each of 
the input single-source datasets by comparison against 
the SRDC and GPCC analyses and tropical Pacific 
atoll gauge data (Morrissey and Green 1991). Due to 
the form of (2), the computed value of H actually ac- 
counts for multiplicative errors in N. and the condi- 
tional rain rate approximation (the term in square 
brackets) in addition to H. It is important to use 
algorithm-determined precipitation to set the constants 
because the random error for a particular technique 
should reflect sampling both of the true rain field and 
of algorithm-dependent fluctuations about the true 
field. 
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Early work on the error problem revealed that it 
was hard to use the absolute error estimates to study 
fluctuations in data quality, due to absolute error’s 
strong dependence on rain rate. According to (2), a 
varies as r 05 at low r and as r 075 at high values. 
Relative error has a similar flaw, behaving as r _05 and 
f-° 25 at low and high r, respectively. Consequently, 
for display purposes we defined an additional variable, 
quality index (QI), in which (2) was inverted to give 
N. as a function of error, and H and S were set to the 
values estimated for gauges. The units of QI are 
“equivalent gauges,” the approximate number of 
gauges required to produce the error in question (for 
the given rain total). Qualitatively, QI values near zero 
are poor and those over 10 are good. 


precipitation field, which has the sampling of the geo- 
synchronous data and the bias of the instantaneous mi- 
crowave estimates. Verification against rain gauge 
analyses over water and land and subjective exami- 
nation of the resulting maps and zonally averaged 
fields indicate that known biases in the GPI in the 
subtropics and over land are reduced using this ad- 
justment approach and that the AGPI estimates are 
superior to either the microwave or the GPI estimates 
alone (Adler et al. 1993, 1994). 

The multisatellite (MS) estimate is formed from the 
three satellite sources (see Fig. 1 ): AGPI estimates 
where available (40°N-S), the weighted combination of 
the microwave estimate and the microwave-adjusted 
low-orbit IR elsewhere in the 40°N-S belt, and the mi- 


3. Combination method summary 

The GPCP combination method is designed to use 
the strengths of each input dataset objectively to pro- 
duce merged global monthly precipitation fields that 
are superior to any of the individual datasets. The 
merger technique is an enhancement of the satellite- 
gauge-model (SGM) technique described in Huffman 
et al. (1995). The technique is outlined in Fig. 1 . The 
resulting suite of precipitation products allows users 
with different requirements to choose different prod- 
ucts. Each precipitation product has an associated er- 
ror estimate that makes it possible for users to assess 
the utility of the estimates. 

As shown in Fig. 1 the microwave estimates are 
approximately time- and space-matched with geo-IR 
observations to derive a microwave/IR calibration 
ratio for each grid box (Adler et al. 1993). In the 
present case the “matched” GPI is found by comput- 
ing the GPI from a month of 3-h geo-IR data that are 
closest in time to the nominal SSM/I overpass time. 
The ratio is controlled to prevent unstable answers, 
and it is smoothly filled in regions where the SSM/I 
is missing but the geo-IR is available. Alternately, in 
regions of light preciptation an additive adjustment is 
computed that is the difference between smoothed 
SSM/I and GPI values when the SSM/I is greater, and 
zero otherwise. In regions lacking geo-IR data, such 
as the Indian Ocean sector, the low-orbit IR contained 
in the GPCP merged IR dataset is adjusted using a 
smoothly varying interpolation of the microwave/geo- 
IR adjustment ratio. The spatially varying arrays of 
adjustment coefficients are then applied to the full set 
of GPI estimates, producing the adjusted GPI (AGPI) 



Fig. I. Flow chart of satellite-gauge-model precipitation 
combination technique. 
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crowave outside of 40°N— S. The combination weights 
are the inverse error variances of the respective esti- 
mates. Such weighted combination of microwave and 
microwave-adjusted low-orbit IR is done because the 
low-orbit IR does not have sufficient sampling to 
warrant the adjustment scheme used for the geo-IR. 

The satellite/gauge (SG) estimate is computed in 
two steps (see Fig. 1). First, for each grid box over 
land the multisatellite estimate is muliplied by the ratio 
of the large-scale (5 grid boxes x 5 grid boxes) aver- 
age gauge analysis to the large-scale average of the 
multisatellite estimate. Alternatively, in low- 
precipitation areas the difference in the large-scale 
averages is added to the multisatellite value when the 
averaged gauge exceeds the averaged multisatellite. 
This keeps the bias of the SG close to the (presum- 
ably small) bias for the gauge analysis on a regional 
scale, even while allowing the multisatellite estimate 


to provide important local detail. Second, the gauge- 
adjusted multisatellite estimate and the gauge analy- 
sis are combined with inverse-error-variance 
weighting. 

Figure 2 provides an example of the GPCP SG 
estimate. Over oceans in the latitude belt 40°N-S 
(except over the Indian Ocean) the estimate is prin- 
cipally based on the AGPI and shows its characteris- 
tic smoothness. Other oceanic regions are noisier, 
reflecting the lower sampling in the SSM/I. The land 
patterns have varying contributions by the gauge 
analysis, depending on location. Changes in data 
source are reflected as sudden changes in QI (bottom 
panel in Fig. 2). The blacked-out regions are consid- 
ered to lack usable estimates from any of the sources. 
This quality control decision is objective and is based 
on factors such as the absence of SSM/I data (at very 
high latitudes), “high” coverage by ice and snow in the 
SSM/I estimate, and a total ab- 



Fig. 2. Satellite-gauge (SG) estimate of precipitation (top, in mm day 1 ) and quality index 
(bottom, in equivalent gauges) for August 1987. Blacked-out areas denote regions with no 


estimate. 


sence of gauge reports. Note the 
relative lack of seams in the pre- 
cipitation field between regions 
with different data coverages. 
The MS fields (not shown) appear 
very similar to the SG fields, 
partly because many oceanic ar- 
eas have no gauges, and partly 
because the gauge and MS pre- 
cipitation values are frequently 
similar. Closer inspection re- 
veals quantitative differences, 
for example, higher values in the 
SG across northern Europe, and 
the SG quality index clearly 
shows higher confidence in 
many land areas. 

4. Results 

We now examine the validity 
and utility of the SG product 
through descriptive analyses 
(section 4) and quantitative 
comparisons (section 5). In this 
section we describe the spatial 
and temporal variability in the 
SG, compare them to long-term 
means published by Jaeger (1976, 
hereafter J) and Legates and 
Willmott (1990, hereafter LW) 
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and compare the interannual variability in precipitation its a broad local maximum in the eastern Pacific ITCZ, 
associated with the El Nino-Southern Oscillation while J does not and LW exhibits higher values. An 
(ENSO) during the 8.5-yr period with that described intercomparison of various satellite-derived precipi- 
in studies based on historical 


gauge observations. 

a. Eight -year mean 

The top panel in Fig. 3 shows 
the SG precipitation averaged 
over the years 1988-95, while 
the middle and bottom panels of 
the figure contain the J and LW 
climatologies for comparison. 
The SG shows the expected 
main features, with maxima in 
the Tropics in the intertropical 
convergence zone (ITCZ) in the 
Atlantic, Pacific, and Indian 
Oceans; in the South Pacific 
convergence zone (SPCZ); and 
over tropical Africa and South 
America. Dry zones in the east- 
ern parts of the subtropical 
oceans are evident. In the mid- 
latitudes, the storm tracks in the 
Northern Hemisphere oceans 
are very distinct. In the South- 
ern Hemisphere the circumpolar 
storm track is weaker with 
maxima southeast of Africa and 
South America and a poleward 
extension of the SPCZ in the 
South Pacific Ocean. Dry areas 
over North Africa, western 
North America, and Australia 
are well defined. 

The general distribution of 
precipitation in the SG is simi- 
lar to the two climatologies, but 
with significant differences in 
detail. We should expect this, 
given different analysis tech- 
niques and resolutions (5° and 
0.5° lat-long grids for J and 
LW, respectively) for the clima- 
tologies and their reliance on 
many years of sparse ship data 
over oceans. In the SG, the 
Pacific ITCZ is narrower from 
north to south, with higher peak 
values than in J. The SG exhib- 



Fig. 3. Eight-year ( 1988-95) annual-mean precipitation in mm day -1 for SG (top), Jaeger 
climatology (J; middle), and Legates-Willmott climatology (LW; bottom). For each grid 
box, the J and LW averages only have contributions for months that have SG data. Blacked- 
out areas denote regions with no estimate. 
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tation estimates by Janowiak et al. 
(1995) showed that there are large dif- 
ferences among the satellite-based esti- 
mates in this region, which appear to be 
related to frequent precipitation not as- 
sociated with deep convection, and the 
satellite techniques’ varying abilities to 
correctly estimate the precipitation un- 
der this condition. However, better vali- 
dation data are required to establish the 
climatology in this area. West of the 
date line, the SG maximum in the ITCZ 
is more distinct than that in J but some- 
what weaker than in LW. However, the 
differences are not as large as in the 
eastern Pacific. In the SPCZ, the maxi- 
mum in the SG is weaker than in either 
climatology. 

The Northern Hemisphere storm 
tracks in the SG are more intense than in 
either climatology. In J, peak amounts 
in the Pacific Ocean storm track are 
found in a broad maximum oriented 
parallel to the west coast of northern 
North America. In the SG, this is a more 
distinctly narrow coastal/orographic 
feature. Similarly, the broad maximum 
extending westward from southern 
South America in J is not evident in the 
SG. In LW, both these features are in- 
termediate between the SG and J depic- 
tions. In the SG, the midlatitude storm 
track in the Southern Hemisphere ex- 
hibits local maxima to the south and east 
of Africa and South America that are 
not found in the climatologies. 

b. Eight-year seasonal means 

Maps of the 8-yr seasonal means 
based on the SG are shown in Fig. 4. The 
ITCZ in the Pacific and Atlantic Oceans 
is strongest in the June-August (JJA) 
and September-November (SON) periods 
with a very strong maximum in the east- 
ern Pacific Ocean in JJA. The Pacific ITCZ 
is the weakest in December-February 


Fig. 4. Eight-year (1988-95) seasonal-mean 
precipitation in mm day -1 for SG. Blacked-out 
areas denote regions with no estimate. 
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(DJF) and has a noticeable double structure in March- 
May (MAM), although the northern branch is still sig- 
nificantly stronger. The Southeast Asian monsoon is 
evident, with maximum intensity in JJA and oceanic 
rainfall maxima both east and west of India. The 
monsoon precipitation maximum shifts south- and 
eastward during SON, and reaches northern Austra- 
lia in DJF. However, the intensity of the monsoon 
system is greatest by far during the Northern Hemi- 
sphere summer season. The SPCZ is evident through- 
out the year and is strongest in DJF. While its western 
boundary changes little through the year, it does ex- 
pand to the north and east during DJF. 

The North Pacific and North Atlantic storm tracks 
are present in all seasons but have their greatest east- 
ward extent in SON and DJF. An axis of enhanced 
precipitation appears to connect the storm tracks to 
the low-latitude maxima over Central America and 
Southeast Asia in the warm sea- 
sons. In southern midlatitudes, 
the storm track is most clearly 
defined in JJA, with peak inten- 
sities in this season in the three 
oceanic maxima southeast of 
Africa, in the mid-Pacific, and 
southeast of South America. 

Pronounced minima in pre- 
cipitation are observed over the 
subtropical oceans and western 
subtropical continents in all sea- 
sons. The oceanic minima are 
strongest and most extensive 
during the summer seasons, 
while the continental minima 
exhibit movements that are re- 
lated to the north-south sea- 
sonal migration of the tropical 
continental maxima. 

The SG seasonal means ex- 
hibit good qualitative agreement 
with J over land but not over the 
oceans. Figure 5 shows the dif- 
ferences between JJA and DJF 
for both the SG and J. Over 
land, the patterns of seasonal 
change are remarkably con- 
sistent, with even minor fea- 
tures such as the DJF maximum 
over the U.S. Gulf Coast in 
agreement. Over the oceans, 
only the very largest scale fea- 


tures, such as the winter maximum in midlatitudes, 
agree and even those exhibit some inconsistencies, as 
in the western North Pacific. A substantial difference 
is seen in the eastern Pacific ITCZ, where the pro- 
nounced JJA maximum in the SG is not seen in J, 
which in fact shows a DJF maximum in part of this 
region. 

c. Full time series 

The time series of the global monthly precipitation 
rate based on the SG is shown in Fig. 6. To achieve 
globally complete fields, missing values in the SG 
monthly fields were filled with climatological precipi- 
tation values for the appropriate month and location 
according to J. The choice of fill-in is not critical as 
long as values are reasonable. For example, filling 
with forecast values produced by the operational 
European Centre for Medium-Range Weather Fore- 



Fig. 5. Extreme-season (JJA-DJF) difference in mm day' for SG (top) and J (bottom). 
For each grid box, the J average only have contributions for months that have SG data. 
Blacked-out areas denote regions with no estimate. 
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Fig. 6 . Time series of the global monthly precipitation rate computed from the 
SG with holes in the field filled with corresponding J values (top), and (J - SGM) 
difference (bottom), both in mm day 


The subtropical dry zones (precipita- 
tion < 2 mm day’ 1 ), centered near lati- 
tude 20° in each hemisphere, are most 
intense (driest) during the winter sea- 
son. During late summer in each hemi- 
sphere when the migrating tropical 
maximum is near its poleward extreme, 
a band of heavier precipitation extends 
to midlatitudes. A band of heavier 
amounts is found centered near latitude 
40° in each hemisphere throughout the 
year. In the Northern Hemisphere, ex- 
tensions of this heavy band to higher 
latitudes are seen in late summer and 
fall, but annual and interannual variabil- 
ity in intensity is small. In the Southern 
Hemisphere, no such extension to higher 
latitudes is visible. However, a modest 


casts numerical weather prediction system yielded 
values that differ by no more than 0.2 mm day' for 
any one month and average 0.06 mm day 1 higher for 
the 8-yr average. The average global total precipita- 
tion for the 8-yr period is 2.53 mm day 1 with a sub- 
stantially higher value over water than over land. The 
difference between the SG and J global means (Fig. 6, 
bottom) is nearly always positive (GPCP less) with 
an average of about 0.3 mm day" 1 . Over land the sea- 
sonal cycle is very evident, with a maximum in the 
boreal summer presumably associated with the maxi- 
mum in Southeast Asian monsoon precipitation dur- 
ing that season. 

The zonally averaged precipitation in the belt 
60°N-S during the 8.5 yr (Fig. 7) exhibits a pro- 
nounced annual cycle in the Tropics and subtropics 
but only modest interannual variability. 

Heaviest precipitation (zonal averages 
> 8 mm day 1 ) is found near 10°N in the 
late boreal summer. This feature is part 
of a belt of maxima that reaches its far- 
thest northward extent about September 
and extends farthest southward, past 
I5"S, in March. Rasmusson and Arkin 
( 1 493) described this pattern using only 
IR-based estimates (their Fig. 20), Find- 
ing the latitudinal extrema in August and 
February. The apparent continuity 
between the summer maxima in each 
hemisphere in Fig. 7 was not found by 
Rasmusson and Arkin except during the 
1986-87 warm episode. 


annual cycle, with maximum values ex- 
ceeding 4 mm day 1 in midwinter, and some inter- 
annual variability can be seen. 

d. ENSO signal 

One of the most important prospective uses of any 
analysis of global precipitation is to examine and un- 
derstand interannual changes in the distribution of 
large-scale precipitation and their relationship to 
variations in the general circulation. It has long been 
clear that a significant portion of the interannual vari- 
ability in the large-scale atmospheric circulation is as- 
sociated with the ENSO phenomenon ( Bjerknes 1 969; 
Arkin 1982). ENSO-related changes in global precipi- 
tation as inferred from rain gauge observations have 
been presented by Ropelewski and Halpert (1987, 
1989, 1996), and Janowiak and Arkin (1991) and 



-/v !$ 


ZONAL AVERAGE RAINFALL (mm/day) 

Fig. 7. Time series of the zonal-average SG precipitation in mm day 
the belt 60°N~S. 
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Arkin et al. (1994) have described the ENSO signal 
in precipitation estimated from satellite observations. 
In this section, we will describe the precipitation 
changes associated with changes in the state of ENSO 
during the period of analysis and compare them to 
those that occurred during previous episodes. 

The period covered by the GPCP analysis was 
characterized by a warm episode that began in late 
1986, a cold episode during 1988-89, and a warm 
episode during 1991-93 (Arkin et al. 1994; Kousky 
1987). The latter was most pronounced during the 
DJF seasons of 1991/92 and 1992/93, and less clear 
during the JJA seasons (Kousky 1993; Mo and Wang 
1994). 

The top panel of Fig. 8 shows the difference in pre- 
cipitation between a warm episode DJF (1991/92) and 
a cold episode DJF (1988/89). Large differences are 
found across the equatorial Pacific Ocean, extending 
into the midlatitudes of the southern Pacific and 
across the Maritime Continent 
into the Indian Ocean. Amounts 
are greater during the warm epi- 
sode in the Central Pacific and 
the east Pacific ITCZ, and less 
farther west. The SPCZ appears 
to be displaced toward the north 
and east during the warm epi- 
sode. Coherent-appearing fea- 
tures of lesser amplitude can be 
seen elsewhere; greater amounts 
during the warm episode in the 
subtropical southern Indian 
Ocean, at the North American 
terminus of the North Pacific 
storm track, over the U.S. Gulf 
Coast, and over portions of Ar- 
gentina and Uruguay; and lesser 
amounts during the warm epi- 
sode over southern Africa and 
the adjacent Indian Ocean, the 
central North Pacific, and equa- 
torial South America and the 
adjacent Atlantic Ocean. 

Differences between warm 
and cold episodes during the 
JJA season are slightly more 
difficult to describe. The JJA 
season of 1992, while nominally 
part of an extended warm epi- 
sode, actually was characterized 
by near-normal sea surface 


temperatures in the equatorial Pacific, with some 
below-normal values in the region of the climatologi- 
cal cold tongue (Mo and Wang 1994). The JJA sea- 
son of 1987 was more representative of canonical 
warm episode conditions (Janowiak and Arkin 1991), 
but the GPCP Version 1 analysis is not available dur- 
ing the first part of that season. We will use both July- 
September (JAS) 1987 and JJA 1992 as examples of 
warm ENSO conditions during the boreal summer, 
while JAS 1988 and JJA 1988 will serve as the cor- 
responding cold episode examples. The difference 
map for JAS 1987-JAS 1988 (bottom panel of Fig. 8) 
exhibits a band of heavier (lighter) precipitation dur- 
ing the warm (cold) episode extending the entire 
width of the tropical Pacific Ocean and into South- 
east Asia to near 90°E. In the eastern Pacific, this band 
appears to represent an enhanced and southward-dis- 
placed ITCZ, with some differences of opposite sign 
along its northern boundary. From 150°W to 140°E, 



Fig. 8. (El Niiio-La Nina) differences in mm day 1 computed from SG fields for (DJF 
1991/92-DJF 1988/89) (top) and (JAS 1987-JAS 1988) (bottom). 
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Fig. 9. Locations of SRDC test site cells. 


the increased precipitation straddles 
the equator, and farther west it trends 
slightly northward across the Philippines 
and Indochina. Similar features were 
found in the JJ A 1992-JJA 1987 differ- 
ence (not shown). Decreased precipita- 
tion during the warm episodes is found 
to the south of this band from 160° to 
past 90°E, although this is much more 
pronounced when JAS 1987 is used. 

Decreased precipitation is found in the 
western North Pacific from 20° to 40°N, 
in the central South Pacific, and along 
the Atlantic ITCZ and into the Caribbean and Gulf 
of Mexico. 

There is good agreement between the SG patterns 
and those found in composite studies using gauges 
(Ropelewski and Halpert 1987, 1989, 1 996; Schneider 
and Rudolf 1994) and anomalous low-level conver- 
gence (Rasmusson and Carpenter 1982; Schneider and 
Fleer 1989). Differences presumably result from limi- 
tations in data coverage (gauge studies) or represen- 
tativeness (convergence studies) and from the 
comparison of composites to individual pairs of epi- 
sodes. The similarity of the present results to these in- 
dependent analyses gives us confidence that the GPCP 
combined dataset is accurately identifying the prin- 
cipal manifestations of interannual variability in pre- 
cipitation during the period, at least qualitatively. The 
results also provide interesting new information on 
oceanic precipitation variability and its relationship 
to variability over land. 


5. Sample validation results 

Even though the limited applicability of prior 
datasets (noted in the introduction as a motivation for 
the project) prevents complete global validation of the 
SG precipitation product, it is possible to make some 
quantitative comparisons. The GPCP Surface Refer- 
ence Data Center (SRDC) at the National Climatic 
Data Center (NCDC) has constructed monthly area- 
average precipitation for fifteen 2.5° x 2.5° cells lo- 
cated in five test site areas (Fig. 9) during the period 
July 1 987-December 1991. The grid cells are located 
almost exclusively over land in regions of fairly uni- 
form gauge spacing to avoid extensive data voids. For 
the purposes of this comparison, the SRDC values 
were multiplied by the same bias-correction values 
that were applied to the GPCC. 

The SRDC estimate is calculated from quality- 
checked daily gauge data using the PRISM analysis 


Table! .Monthly numbers of gauges used in the SRDC and gauge analyses for each test-site area box. 


Site 


25th 

percentile 


SRDC 

50th 

percentile 


75th 

percentile 


25th 

percentile 


GPCC 

50th 

percentile 


75th 

percentile 
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program (Daly et al. 1994). PRISM 
is based on linear regressions of 
gauge precipitation and terrain 
(slope and elevation). The regres- 
sion equations are used with a very 
fine grid of elevations and slopes 
(obtained from a digital elevation 
model) to infer precipitation over 
the entire grid. 

Although both SRDC and GPCC 
provide gauge analyses, the datasets 
are quite different. The SRDC 
analyses are based on more gauges 
than the GPCC analyses (Table 1 ). 

As well, the analysis techniques take different ap- 
proaches. Very few of the GPCC and SRDC gauges 
are collocated, and the SRDC analyses do not change 
significantly when GPCC sites are excluded from 
bKDC analyses. Except for these few cases, the GPCC 
and SRDC analyses are independent in the sense that 
they are not based on data from the same gauges. In a 
deeper sense the two analyses cannot be independent 
since correlation distances for monthly averages ex- 
ceed the typical interstation spacing in the SRDC 
dataset. 

Two error measures are available for the SRDC 
analysis. The first is a confidence interval that repre- 
sents the fit of the gauge data to the linear functions 
used to interpolate precipitation. This conservative 
estimate of the uncertainty in the relation between 
precipitation and elevation is most applicable in 
mountainous areas. The average of the 70% predic- 
tion confidence interval is ±0.95 mm day-'. The sec- 
ond error measure is the cross-validation error, the 
average difference between an observed gauge value 
and the regression-estimated value at the gauge lo- 
cation. The cross-validation error ranges from -0 92 
to 0.36 mm day ' in the SRDC dataset, with an aver- 
age of -0.03 mm day 1 . 

The SRDC analysis was compared to the GPCP 
Version 1 MS, gauge, and SG analyses, producing the 
summary statistics in Table 2. Scatterplots for the in- 
dividual matchups are displayed in Fig. 10. As ex- 
P ^ n ‘ he M S field (which is the AGPI estimate for 
f test s ' tes > except Canada) shows a relatively 

large root-mean-square (rms) error. The gauge is 
somewhat surprisingly close to the SRDC despite the 
small number of gauges. This correlation between the 
two gauge-based datasets is not due to duplicate gauge 
ata, which occur only occasionally; it indicates the 
importance of a good analysis system. Although well 


SoX 2 , STKSSE a " “ ,,S - — - »*. *. MS. g ,u E , «, 


Technique 

Bias 

(mm day -1 ) 

Average 
absolute diff. 
(mm day -1 ) 

Rms error 

(mm day -1 ) 

Correlation 

coefficient 

m 

MS (multisatellite) 

-0.190 

1.449 

2.048 

0.545 

Gauge (GPCC) 

0.222 

0.607 

1.151 

0.853 

SG (satellite gauge) 

0.121 

0.665 

1.081 

0.864 


correlated, the gauge analysis has a positive bias 
which is improved in the SG. 

Figure 1 0 shows fewer large errors for the SG than 
or the gauge but slightly more spread close to the 1 : 1 
line. Quantitatively, this is reflected in lower rms er- 

™ r ,^° r * h ^ SG but higher average absolute error 
(table 2). One constraint on improvement is that the 
satellite values are adjusted to the large-scale gauge 
bias before combination. This step works as intended 
m pulling the SG bias toward the gauge bias, but it 
has the side effect of smearing steep gradients. It 
should also be noted that the SRDC validation results 
show seasonal and regional variations, so that the 
average results shown here depend on the locations 
and sizes of test sites. Thus, we are of the opinion that 
the performance of the SG combination is acceptable 
even with the current strong approximations in esti- 
mating error. 

On the whole, the estimated rms errors track with 

™ S errors (i e ’ differences from 
SRDC) for the various fields as a function of rain to- 
tal. However, the estimates tend to be lower than the 
“ 311 but the gauge ’ which was calibrated from 
T S' WC ,nter P ret this res uK as indicating that 
,. e SI ; D i C test sltes hav e higher variability in the sat- 
ellite fields than the average for all of the land areas 
in the zone 40°N-S (which were used to develop the 
coefficients). About half of the test sites are in coastal 
areas, which we often observe to have higher variabil- 
ity in precipitation than continental interiors or open 
ocean. r 


6. Concluding remarks 

The GPCP has released a new set of global precipi- 
tation estimates based on the sequential combination 
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of microwave, IR, and gauge data. At each stage, bi- 
ases were removed where possible, and the combina- 
tion is tailored to the types of data. The dataset covers 
the period July 1987 through December 1995 on a 
monthly 2.5° x 2.5° lat-long grid. Each input and 
combination field is a separate product, and estimates 
of the random part of the error are provided with each 
precipitation field. These (spatially and temporally 
varying) errors allow users to assess the utility of the 
grid values for their own application. Such applica- 
tions might include validation of numerical climato- 


F,g. 10. Scatterplots of (a) MS, (b) gauge, and (c) SG versus 
SRDC, all in mm day' 1 for all cells and months. 

logical models, calibration of hydrological models, 
and benchmarking of experimental rainfall estimation 

techniques. , 

Time- and space-averaged Version 1 combined 

satellite-gauge fields show fair agreement with stan- 
dard climatologies, for example yielding a global 
annual-average precipitation rate of 2.5 mm ay , 
which corresponds remarkably well to previous cli- 
matological estimates. On seasonal and regional scales 
systematic differences start to appear. Validation over 
land shows that the Version 1 multisatellite combina- 
tion provides useful information, but the addition of 
even a few gauges greatly improves the estimate. 

Four major themes are likely for future releases o 
the GPCP combination. Extension to earlier perio s 
that lack SSM/I data is needed to fulfill the mandate 
to start in 1986. Error estimation needs to be improved 
and refined. A whole class of users would benefit 
from globally complete fields. A research version of 
globally complete fields is already in development, 
based on the use of estimates of precipitation from 
four-dimensional data assimilation models. Fina y, 
upcoming satellites such as Tropical Rainfall Mea- 
surement Mission hold great promise for refining the 
calibrations of the SSM/I and GPCP estimates. 

The GPCP Version 1 Combined Precipitation Data 
Set is archived at World Data Center A (WDC A, 
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Asheville, North Carolina), which is collocated with 
NCDC. The most current release is Version la. Datasets 
may be obtained by contacting A. McNab (704-271- 
4592; NCDC, Federal Buildling, 151 Patton Ave., 
Asheville, NC 28801-5001), by Internet access to the 
WDC-A home page (URL http://www.ncdc.noaa.gov/ 
wdcamet . html#GPCP ) , or by anonymous FTP. For the 
last, FTP to ftp.ncdc.noaa.gov, log in as “anonymous” 
using your complete e-mail address as the password, 
“cd” to subdirectory pub/data/gpcp/versionl, set bi- 
nary transfer mode, and “get” the file readme.txt. Files 
are binary, providing 1 yr of data for a particular data 
field (a variable from a technique). 
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